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ABSTRACT 


Networks  of  queues  are  important  models  of  multi  programmed  time- 
shared  computer  systems  and  computer  communication  networks.  Although 
equilibrium  state  probabilities  of  a broad  class  of  network  models  have 
been  derived  in  the  past,  analytic  or  approximate  solutions  for  response 
time  distributions  or  more  general  passage  time  distributions  are  still 
open  problems.  In  this  paper  we  formulate  the  passage  time  problem  as  a 
"hitting  time"  or  "first  passage  time"  problem  in  a Markov  system  and 
derive  the  analytic  solution  to  passage  time  distributions  of  closed 
queueing  networks.  Efficient  numerical  approximation  is  also  proposed. 

The  result  for  closed  queueing  networks  is  further  extended  to  obtain 
approximate  passage  time  distributions  for  open  queueing  networks.  Finally, 
we  employ  the  techniques  derived  in  this  paper  to  study  the  interfault  time 
and  response  time  distribution  and  density  functions  of  mul ti programmed 
computer  systems.  The  effects  of  program  behavior,  degree  of  multiprogramming, 
size  of  main  memory,  service  time  of  paging  devices  and  rate  of  file  I/O 
requests  on  the  shape  of  distribution  functions  and  density  functions  have 
been  examined. 


The  work  described  herein  was  supported  in  part  by  the  Ballistic  Missile 
Defense  Systems  Command  under  contract  no.  DASG60-77-C-0073.  Computer  time 
was  made  available  by  the  Stanford  Linear  Accelerator  Center. 


1.  INTRODUCTION 


Queueing  network  models  have  important  applications  in  computer  system 
and  computer  communication  network  modeling.  Work  on  tnis  application  in  the 
last  several  years  has  produced  a variety  of  models  to  capture  important 
aspects  of  computer  systems  and  computer  communication  networks.  A broad  class 
of  queueing  network  models  has  been  put  together  under  a uniform  framework  by 
Baskett,  Chandy,  Muntz  and  Palacios  [2]  and  further  extended  by  Kobayashi  and 
Reiser  [18]  [26].  Obtainable  from  such  analyses  are  stationary  queue  length 
distributions  and  measures  of  system  performance  derivable  from  them,  such  as 
mean  response  time  of  each  user  and  throughput  of  the  system.  Although  the 
average  response  time  or  waiting  time  can  be  obtained  easily  from  the  mean 
queue  length  based  on  Little's  formula,  "L  = XW"  , the  analytic  or  approximate 
solution  for  response  time  or  waiting  time  distribution  is  still  an  open  problem. 
Clearly,  the  distribution  function  is  much  more  informative  than  the  mean 
because  it  can  provide  us  with  higher  moments,  quantiles,  percentiles,  etc., 
i.e.  fluctuations  around  the  mean. 

Using  simulations  via  regenerative  techniques  to  obtain  interval  estimations 
of  passage  time  distributions  has  been  studied  by  Iglehart  and  Shedler  [14]  for 
closed  queueing  networks,  where  passage  time  is  defined  to  be  the  time  for  a 
job  to  traverse  a portion  of  the  network  [14],  hence,  a more  general  term  than 
response  time  or  waiting  time. 

In  section  2,  we  briefly  survey  the  history  on  the  development  of  the 
theory  of  queueing  network  models.  In  section  3,  we  summarize  the  queueing 
network  models  where  analytic  results  are  available  and  the  results  on  steady 
state  probabilities  of  the  queueing  networks.  In  section  4,  we  derive  the 
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passage  time  distributions  of  the  closed  queueing  networks  specified  in 
section  3 using  the  concept  of  "hitting  time"  or  "first  passage  time"  of 
Markov  processes.  In  section  5,  we  use  a multi  programmed  computer  system 
model  which  is  a two  stage  cyclic  queueing  network  as  an  example  to  go  through 
all  the  details  involved  in  calculating  a passage  time  distribution.  In 
section  6,  an  efficient  numerical  technique  to  obtain  approximate  passage  time 
distributions  of  the  closed  queueing  networks  is  proposed.  The  approximate  solutions 
are  very  satisfactory.  In  section  7,  we  examine  the  open  queueing  networks  specified 
in  section  3 and  extend  the  results  on  the  closed  queueing  networks  to  obtain 
approximate  passage  time  distributions  for  the  open  networks.  The  major 
difficulty  encountered  in  handling  the  open  queueing  networks  is  the  infiniteness 
of  its  state  space  as  we  shall  see  later.  In  section  8,  we  employ  the  derived 
techniques  to  study  interfault  time  distributions  and  resportse  time  distributions 
of  multi  programmed  computer  systems.  Finally,  in  section  9 we  draw  the 
conclusion. 
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2.  SURVEY 


A good  survey  on  the  development  of  the  theory  of  queueing  network 
models  up  to  1972  is  contained  in  Muntz  and  Baskett  [22].  Here,  we  excerpt 
the  contents  from  [22]  and  fill  in  the  development  thereafter.  For  some  ten 
years,  the  most  general  class  of  queueing  networks  for  which  an  analytic 
solution  was  known  is  treated  by  Jackson  [15].  Jackson  develops  the  equilibrium 
distributions  of  the  states  of  the  following  class  of  networks: 

(1)  All  the  service  time  distributions  are  exponential 

(2)  All  the  customers  are  identical 

(3)  A customer  leaving  a service  center  can  choose  the  next  service 
center  according  to  a fixed  set  of  branching  probabilities  for 
the  service  center  being  left 

(4)  External  arrivals  and  departures  are  allowed.  The  external 
arrival  process  is  assumed  to  be  Poisson. 

Gordon  and  Newell  [13]  treat  the  similar  kind  of  queueing  networks  and  make 
clear  the  product  form  of  the  solution  of  the  balance  equations  describing  the 
steady  state  of  the  model.  Properties  (1)  and  (2)  of  this  class  of  networks 
are  clearly  limitations  of  its  applications.  In  the  past  few  years,  applications 
of  such  queueing  networks  to  the  modeling  of  computer  systems  have  been  drawing 
increasing  attention,  and  the  extensions  of  the  above  results  to  more  general 
queueing  networks  have  been  attempted  by  various  authors.  We  summarize  their 
results  in  the  following: 

Ferdinand  [12]  analyzed  a particular  queueing  model  called  the  finite 
source  model  or  the  machine  repairman  model.  The  system  was  a cyclic  model 
with  two  service  centers  and  consists  of  a fixed  number  of  customers.  One 
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service  center  which  can  be  viewed  as  the  CPU  is  shared  by  all  waiting  customers 
simultaneously  at  a service  rate  inversely  proportional  to  the  number  of  customer- 
being served.  The  other  service  center  which  can  be  viewed  as  the  terminals 
consists  of  a sufficient  number  of  servers  so  that  no  queueing  occurs.  These 
two  types  of  service  centers  will  be  referred  to  as  processor  sharing  and  infinite 
server  stations,  respectively.  Different  classes  of  customers  are  allowed,  i.e. 
each  customer  has  its  own  pair  of  exponentially  distributed  service  times,  one 
for  each  service  center.  Posner  and  Bernholz  [14]  consider  the  more  general 
network  model  of  Gordon  and  Newell  where  each  customer  has  its  own  set  of 
branching  probabilities,  and  exponentially  distributed  service  times. 

The  network  is  closed  and  two  kinds  of  service  models  are  allowed:  FCFS  (first- 
come  first-served)  and  processor  sharing.  Only  under  the  processor  sharing 
discipline,  different  customers  may  have  different  service  time  distributions. 

Sakata,  Noguchi,  and  Oizumi  [27]  discovered  that  when  processor  sharing 
scheduling  was  applied  to  the  classical  infinite  source  queueing  model  (M/G/1), 
the  equilibrium  distribution  of  queue  sizes  for  the  model  was  the  same  as  that  j 

for  a similar  model  with  exponentially  distributed  service  times  with  the  same  | 

mean  as  the  original  general  distribution.  Baskett  [1]  derived  a similar  result 
for  a finite  source  model  in  which  the  service  time  distribution  at  both 
service  centers  have  rational  Laplace  transforms  and  Baskett  and  Palacios  [4] 
extended  that  result  to  the  central  server  network  model  which  Buzen  [5]  has 
studied.  The  equilibrium  solutions  have  the  product  form.  The  model  includes 
FCFS,  processor  sharing  and  infinite  server  types  of  service  centers  and  the 
service  time  distributions  can  be  any  distributions  with  rational  Laplace  trans- 
forms for  the  two  latter  types  of  service  centers.  But  the  queueing  system  is 
closed  and  only  a single  class  of  customers  is  allowed. 
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Whittle  [28]  [29]  showed  that  the  balance  equations  describing  the  under- 
lying birth  and  death  processes  could  be  replaced  by  sets  of  "independent" 
balance  equations.  Chandy  [8]  extended  this  technique  to  more  complex  models 
and  extended  the  range  of  networks  for  which  product  form  solutions  can  be 
found.  Chandy  developed  the  solution  for  networks  in  which  the  service  center 
is  of  FCFS,  processor  sharing  or  LCFS  type  and  in  which  all  customers  are  the 
same.  Palacios  [23]  independently  developed  solutions  for  a particular  network 
with  "types"  of  customers.  In  [9],  Chandy,  Keller  and  Browne  further  extended 
the  concept  of  customer  "type"  and  added  the  concept  of  customer  "mode"  for 
general  networks. 

The  recent  most  noteworthy  progress  in  extending  the  class  of  analytically 
solvable  queueing  networks  has  been  done  by  Baskett,  Chandy,  Muntz  and  Palacios 
[2].  These  authors  have  succeeded  in  casting  into  a unified  theory  of  previously 
known  but  unconnected  results  such  as  queue  size  distributions  for  M/M/1  with 
FCFS  discipline,  general  service  time  distribution  for  processor  sharing,  infinite 
server  discipline  and  pre-emptive  --resume  LCFS  discipline  and  queueing  systems 
with  various  classes  of  customers.  Reiser  and  Kobayashi  [26]  generalize  the 
result  of  [2]  to  the  case  in  which  customer  transitions  are  characterized  by 
more  than  one  closed  Markov  chain.  The  technique  of  generating  function  has 
been  applied  to  obtain  closed  form  solutions.  Kobayashi  and  Reiser  [18]  further 
extend  the  job  routing  behavior  to  high  order  Markov  chain,  i.e.  the  transition 
probability  of  a job  from  one  station  to  the  other  can  depend  on,  at  least, 
the  last  two  stations  it  has  visited  and  not  just  the  last  one.  In  Lam  [20], 
the  class  of  queueing  networks  with  a product  form  solution  is  extended  to  include 
state  dependent  lost  arrivals  and  trigger  arrivals.  Such  queueing  network  models 
can  be  used  to  model  store  and  forward  packet  switching  nodes  and  multiprogramming 
computer  systems  with  storage  constraint. 
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3.  EQUILIBRIUM  STATE  PROBABILITIES  OF  QUEUEING  NETWORKS 


In  this  section,  we  examine  the  class  of  queueing  network  models  whose 
equilibrium  state  probabilities  have  been  obtained  by  Baskett,  Chandy,  Muntz 
and  Palacios  [2]  under  a uniform  framework.  Their  results  will  serve  as  the 
basic  foundation  for  our  subsequent  analysis  of  passage  time  distributions. 

The  queueing  network  can  have  any  kind  of  topologies  and  any  number  of 
service  centers.  The  customers  may  have  different  classes.  We  will  assume 
that  the  queueing  network  consists  of  M service  centers  and  R classes  of 
customers.  At  any  time,  each  job  can  only  be  in  one  job  class,  but  it  may 
change  class  as  it  traverses  through  the  network.  Upon  completing  service  at 

center  i,  a job  of  class  j is  routed  to  center  k and  changes  to  class  I with 

R R R 

probability  P . . ...  Furthermore,  the  routing  matrix  P = {P..  . k<  M,l< j ,f 

1 J » K>C  1 J 5 

can  be  considered  as  defining  a Markov  chain  whose  state  space  is 
( (i  >j  ) » 1 1 i £M  , 1 £ j £ R}  and  transition  matrix  is  P . 

The  most  general  service  time  distribution  considered  is  the  one  which 
has  a rational  Laplace  transform.  All  exponential,  hyperexponential  and  hypo- 
exponential  distributions  have  rational  Laplace  transforms.  Cox  [10]  has  shown 
that  any  distribution  with  a rational  Laplace  transform  can  be  represented  by 
a network  of  exponential  stages  of  the  form  shown  in  Fig  3.1. 


Fig  3.1  Representation  of  service  time  distributions  by  the  method 
of  stages 
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In  Fig  3.1,  1-a.  is  the  probability  that  the  customer  leaves  the  service 
center  after  the  i-th  state  and  a^.  is  the  probability  that  the  customer  goes 
to  the  next  stage.  The  service  time  at  stage  i has  an  exponential  distribution 
with  mean  — . 

The  service  centers  of  the  network  can  be  any  of  the  following  four  types. 
Type  1:  The  service  discipline  is  first-come  first-served  (FCFS).  All 

customers  must  have  the  same  service  time  distribution  regardless 
of  its  job  class. 

Type  2:  There  is  a single  server  at  the  service  center,  the  service  discipline 

is  processor  sharing.  The  processing  rate  is  reduced  to  1/r  if  n is 
the  number  of  customers  sharing  the  service.  Each  class  of  customers 
may  have  a distinct  service  time  distribution.  The  service  time  can 
have  any  distribution  with  a rational  Laplace  transform. 

Type  3;  The  number  of  servers  in  the  service  center  is  greater  than  or  equal 
to  the  maximum  number  of  customers  that  can  be  queued  at  this  center. 
This  is  call  the  infinite  server  model.  Each  class  of  customers  may 
have  a distinct  service  time  distribution.  The  service  time  can  have 
any  distribution  with  a rational  Laplace  transform. 

Type  4:  The  station  contains  a single  server,  the  queueing  discipline  is  pre- 

emptive-resume last-come  first-served  (LCFS).  Each  class  of  customers 
may  have  a distinct  service  time  distribution.  The  service  time  can 
have  any  distribution  with  a rational  Laplace  transform. 

Notice  a type  2 service  center  is,  in  many  cases,  a reasonable  representat 
of  the  central  processing  unit  allocating  small  quanta  to  jobs  requiring  service 
in  round  robin.  Actually,  it  is  the  limiting  case  of  round  robin  scheduling 


where  the  size  of  the  quanta  goes  to  zero  and  no  overhead  is  associated  with 
job  switching.  A type  3 center  is  often  used  to  represent  the  terminals  in  a 
time  sharing  system.  Type  1 service  centers  are  most  commonly  encountered,  e.g 
in  modeling  secondary  storage  I/O  devices,  channels  in  point  to  point  computer 
communication  networks,  etc.  A type  4 service  center  may  also  be  used  to  model 
CPU  since  it  is  an  efficient  preemptive  scheduling  algorithm. 

We  now  proceed  to  examine  the  state  of  the  network  model.  The  state  can 
be  represented  by  a vector 

'V  /b  'b  V 

S - ( X-|  , . . • , X|^ ) 

% 'V/ 

where  x^.  represents  the  state  at  service  center  i.  The  exact  form  of  x^. 
depends  or  the  type  of  service  center  i.  We  summarize  the  forms  of  x^'s  below: 
Type  1 service  center; 

X.  = (x., , . . . , X.  1 
^ 1 1 1 n^- 

x^j  is  the  class  of  the  j-th  customer  at  station  i in  FCFS  order  and  n^.  is  the 
number  of  customers  at  station  i. 

Type  2 or  3 service  center: 

Xi  - (v.^,  v-2,  ....  v.p) 

v^^  is  a vector  (m^-^-|,  "’i ^2’  • • • > '^■j ) and  the  £-th  component  of  v.^  is 

^ir  ’ 

the  number  of  customers  of  class  r in  center  i and  in  the  £-th  stage  of  service 
is  the  number  of  stages  of  the  service  time  for  a class  r customer  at 
service  center  i. 


Type  4 service  center; 


X . 


= (( 


il 


"il)« 


(ri2. 


^2^ 


(r, 


in^  in^ " 


r^.j  is  the  class  of  the  j-th  customer  in  LCFS  order  and  ni_  is  its  stage  of 


service. 

Let  S(t)  be  the  stochastic  process  describing  the  state  of  the  network 
model  and  E be  the  collection  of  all  permissible  state  s's  described  above. 
Propositon  3.1 

•V  % % 

S(t)  = S»(t))forms  a continuous  time  Markov  chain  with  state 

* Op  ” 

space  E = {s},  where  S^.  (t)  describes  the  state  of  the  i-th  service  center  at 
time  t. 


Before  presenting  the  solution  to  the  class  of  networks  described  jcove, 

we  need  to  define  a set  of  terms  that  appear  in  the  solution.  Recall 
R R 

P = {P^^  jg}  is  the  routing  matrix  and  defines  a Markov  chain.  This  Markov 
chain  is  assumed  to  be  decomposable  into  f*  ergodic  subchains.  Let  E^ , E^,  .... 
be  the  sets  of  states  in  each  of  these  subchains.  For  each  ergodic  subchain 
E|^,  we  define  the  following  set  of  equations 


H.rtcE,  'ir  V.js  *"0. 

The  value  of  is  determined  by  the  rate  of  exogenous  arrivals  of  class 
s customer  to  service  center  j.  Notice  if  q^^  = 0 M U,s)  eE^, 
is  closed  with  respect  to  E.  . In  this  case,  e.  can  only  be  determined  to 

K I ' 

within  a multiplicative  constant  and  interpreted  as  the  relative  arrival  rate 
of  class  r customer  to  service  center  i.  If  not  all  q.  = 0 for  (j,s)cE.  , 

J S K 

then  we  assume  a unique  solution  fore.  . In  this  case,  e.  is  the  absolute 

1 r 1 r 

arrival  rate  of  class  r customers  to  service  center  i. 


One  further  definition  is  required.  If  at  the  i-th  service  center  the 
customers  of  class  r have  a service  time  distribution  that  is  represented  as  a 
network  of  stages,  then  it  will  be  represented  as  shown  in  Fig  3.2. 
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Note: 

(1)  the  first  subscript  denotes  the  service  center 

(2)  the  second  subscript  denotes  the  class  of  the  customer 

(3)  the  third  subscript  denotes  the  stage  of  service 

Fig  3.2  Representation  of  the  service  time  distribution  of  a class 
r customer  at  service  center  i 

Let  A.  „=  a.  i.e.  A.  „ is  the  probability  that  a class  r 

customer  at  station  i will  reach  its  j-th  stage  of  service.  Finally,  we 
state  the  equilibrium  state  probabilities  in  the  following  theorem  which  can 
be  proved  by  checking  that  the  independent  balance  equations  are  satisfied. 

Theorem  3.1: 

For  a network  of  service  stations  which  is  open,  closed  or  mixed  in  which 
each  service  center  is  of  type  1,  2,  3 or  4,  the  equilibrium  state  probabilities 
are  given  by 

P (x^,  x^,  ...,  x^^)  = cd  (s)  f^  (x^)  f2lx2)  ...  f|v,iX|^) 

where  c is  a normalizing  constant  chosen  to  make  the  equilibrium  probabilities 
sume  to  1 , 

s IS  an  abbr.  of  (x.| , X2,  . . . , x^^)  , 
d(s)  is  a function  of  the  number  of  customers  in  the  system. 
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and  each  f^.  is  a function  that  depends  on  the  type  of  the  service 
center  i. 


To  be  more  specific,  for 
Type  1 service  center: 


f.(^.)=  (i)"’  e. 

11  J=1  ix^j 


Type  2 service  center: 


f lx  ) = n ' _1 

TilXi;  n..  11  11^^^  1 L ^ 


irt: 


p A m 

'y  X „R  „ ir  ,r  ir  ir£n  ir£  1 


m 


-r  } 


^rl 


Type  3 service  center: 


ir£ 


m.  „! 
trl 


Type  4 service  center: 


f.(x.)  = 
1 1 ' 


n . ^i r . . ^i r . •ill-  • 
n.i  f _y !J_iL] 

^ir.  .m. . 

ij  iJ 


If  the  arrivals  to  the  system  depend  on  the  total  number  of  customers 

in  the  system,  denoted  by  M(s),  and  the  arrivals  of  class  r customers  to  center 

/\ 

i follow  fixed  probability  then 

-ft)  - x(i) 

where  A(i)  is  the  arrival  rate  when  the  total  number  of  customers  is  equal  to  i. 

Another  case  of  interest  will  be  the  case  where  the  arrival  process  consists 
of  m Po’sson  arrival  streams  corresponding  to  the  m ergodic  subchains  mentioned 
before.  Let  ^j(i)  be  the  instantaneous  mean  arrival  rate  for  the  j-th  stream 
when  the  total  number  of  customers  in  the  j-th  subchain  is  equal  to  i and  M(s/Ej) 
denote  the  number  of  customers  in  the  j-th  subchain  when  the  state  of  the  system 
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is  s , then 

<Hh  = Xj(i) 

If  the  network  is  closed,  then 
d(s)  = 1 

Now  let  us  consider  the  marginal  distribution  of  queue  lengths.  Define 
an  aggregate  system  state  as  the  number  of  customers  of  each  class  in  each 

'\x 

center.  More  formally,  an  aggregate  state  W of  the  system  is  given  by 

(y-l*  Y2»  •••  yf/|)  where  y^  = (n.^,  with  denoting  the  number 

of  customers  of  class  r in  service  center  i.  Let  — ^ be  the  mean  service 

^ir 

time  of  a class  r customer  at  service  center  i. 

Theorem  3.2 

% 

The  equilibrium  distribution  of  the  aggregate  state  W = (y.| , ...,  y^^) 
is  given  by 

PIW  = (y^,  ....  = cd(Wl  g^G^)  ... 

where 

R 1 n.  , n. 

for  Type  1 service  center:  9^- (y^ ) = n . ! {n  (e^.^)  ^ 

i r ' i 

Type  2 or  4 service  9,- (y,- ) = n .!  n 

center:  ’ ’ ^ ' "ir  ^ir 

^ R 1 e.  n. 

Type  3 service  center:  g^iy.:)  = n 

ir‘  ^ir 

The  most  obvious  implication  of  this  theorem  is  that  the  equilibrium 
distribution  of  the  aggregate  state  W depends  only  on  the  means  of  the  service 
time  distributions. 

A further  simplification  is  possible  if  the  network  is  open  and  the 
arrival  process  does  not  depend  on  the  state  of  the  model.  Let  ?»  = ln.|  ,0^, . . . ,0^^) 
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be  the  aggregate  state  which  represents  the  total  number  of  customers  in  each 


service  station.  Let  = {r:  class  r customers  may  require  service  at 
service  center  i}.  Furthermore,  let 

^i  ” ^rtR^  service  center  i is  type  1 

p.  = E n A 

r c R^ 


— \ 

V‘i^}  if  service  center  i is  type  2,3  or  4. 


Theorem  3.3 

The  equilibrium  distribution  of  the  aggregate  state  ii  = (n^,...,nj^) 
of  an  open  network  with  state  independ^fent  arrival  rate  is  given  by 

M 

P (?i  = (n,,...,n J)  = n P (n.) 

I h 1 1 


where 


n . 


P.(n.)=(l-p.lp.^  . . . . , _ . 

TV  ^11  if  service  center  is  type  1,  2,  or  4 

n . 

-Pi  P.  1 

) = e ( j ) if  service  center  is  type  3 

Furthermore,  various  forms  of  state  dependent  service  rates  can  easily 
be  incorporated  into  the  network  models.  We  examine  the  following  three  cases 


Case  1:  The  service  rate  at  a service  center  depends  on  the  total  number  of 

customers  at  that  service  center.  Let  v^(n^)  be  the  rate  of  service 

at  the  i-th  service  center  when  there  are  n.  customers  at  that 

service  center  relative  to  the  service  rate  when  n^.  = l.  Then  fj(Xj) 

in  theorem  3.1  becomes  (y"’v^(a))  . ihis  form  of  state 

a=l 

dependent  service  rate  is  very  useful.  Consider  the  case  when  the 
i-th  service  center  contains  k^.  multiple  servers,  we  can  let 


n.  , 1 < n.  < k. 
"'kj  , n,>k, 


Cases  where  j is  a general  function  of  n^.  can  be  found  in 

Kobayashi  [19]. 

Case  2:  The  service  rate  of  a class  r customer  at  service  center  i depends 

on  the  number  n.  of  class  r customers  at  service  center  i.  This 
^r 

form  of  state  dependent  service  rate  can  not  be  modeled  for  type  1 

service  centers.  Let  y^^(n^^)  be  the  service  rate  of  class  r 

customers  at  service  center  i relative  to  the  service  rate  when  there 

is  only  one  class  r customer  at  service  center  i.  In  this  case, 

R n 


n M-jp 

f.(x. ) is  replaced  by  f- (x.)  n n (1/  (g)  ) 

’ ^ 1 1 r=l  a=l  '^ir 


Case  3:  The  state  dependent  service  rate  involves  the  number  of  customers 

in  several  service  centers.  Let  I = {i^.  .i^.- • • 

be  a subset  of  the  service  centers.  Let  n,  = Z n.  and 

^ iel  ’ 

let  2j(nj)  be  the  relative  service  rate  to  customers  in  the  subset  I of 

service  centers  relative  to  the  service  rates  when  n,  is  one. 

ni  I 

In  this  case  IT  f.  (x.)  becomes  II  n (l/i  (ai) 

id  ' ' id  ' ' ad 

Finally,  we  note  that  these  various  forms  of  state  dependent  service 

rates  can  be  combined. 
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4.  ANALYTIC  SOLUTION  ON  PASSAGE  TIME  DISTRIBUTIONS  FOR  THE  CLOSED  QUEUEING 
NETWORKS 

In  this  section,  we  obtain  the  distribution  functions  of  passage  times 
for  the  closed  queueing  networks  specified  in  the  previous  section.  That  is 
to  say  the  closed  queueing  networks  can  have  any  of  the  four  types  of  service 
centers  and  different  classes  of  customers  whose  transitions  follow  some  routing 
chain.  The  passage  time  considered  is  the  time  required  for  a job  to  reach  a 
specified  destination  from  a given  start  where  certain  restrictions  may  be  put  on 
the  passage.  A formal  definition  will  be  given  later.  Response  time  is  taken 
to  be  the  time  measured  between  the  arrival  instant  of  a customer  and  its 
departure  instant  for  open  queueing  networks  and  the  time  required  for  a job 
to  go  through  a complete  circuit  or  a loop  for  closed  queueing  networks.  As  we 
shall  sec  that  response  time  is  actually  a special  kind  of  passage  time.  The 
reason  why  we  restrict  ourselves  to  closed  networks  in  this  section  is  simply 
because  the  state  space  of  a closed  queueing  network  is  finite.  The  advantage 
of  the  finite  state  space  will  be  aparent  later  on. 

We  will  first  introduce  the  concept  of  a tag  job.  Concentrating  on  the 
behavior  of  the  tag  job  as  it  traverses  through  the  network  provides  us  a means 
to  evaluate  the  passage  time  distribution.  Each  job  is  assumed  to  have  equal 
probability  of  being  tagged.  In  order  to  keep  track  on  the  position  of  the 
tag  job,  we  need  to  augment  the  state  variable  described  in  section  3 by  an 
extra  component  which  describes  the  position  of  the  tag  job  in  the  network. 
Examining  the  state  variable,  we  see  that  in  the  FCFS  and  LCFS  service  station, 
there  is  a one  to  one  correspondence  between  the  state  components  and  the  jobs 
in  the  system.  This  component  specifies  both  the  position  (implicitly)  and 
the  class  of  the  corresponding  job.  It  also  specifies  the  stage  of  service 
if  the  center  is  LCFS.  By  letting  K(t)  denote  the  index  of  the  state  component 
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i , 


I 


i 


describing  the  tag  job  at  time  t,  we  can  identify  the  tag  job  position  and 
status  unambiguously.  But,  for  the  processor  sharing  and  infinite  server 
service  centers,  a one  to  one  correspondence  between  state  components  and  jobs 
does  not  exist.  All  jobs  in  the  same  job  class  and  stage  of  service  will 
be  described  by  a single  state  component,  i.e.  they  are  indistinguishable 
under  the  state  description.  Since  we  are  only  interested  in  the  stochastic 
behavior  of  the  tag  job,  this  ambiguity  really  doesn't  matter.  This  ambiguity 
only  elaborates  a little  bit  the  specification  of  the  transition  matrix  as 
we  shall  see.  We  will  still  let  K(t)  indicate  the  index  of  the  corresponding 

state  component  in  the  state  variable.  Let  S(t)  be  the  stochastic  process 

a. 

defined  in  proposition  3.1  and  (S(t),K(t))  be  the  new  stochastic  process  just 

described,  and  let  E and  E'  be  the  corresponding  state  spaces,  respectively. 

Recall  (S(t)  } forms  a continuous  time  Markov  chain.  Let  {T^}  be  the  jump 

times  of  the  Markov  chain,  with  0 = T» < T,  < . . . . Clearly  S(T.  ) is  related 

^ 0 0 

to  through  the  transition  matrix,  P = o'f  S(t).  If  the  tag 

job  is  a FCFS  or  LCFS  station,  at  the  service  completion  instant,  i.e.  the 
jump  time  of  the  underlying  Markov  chain,  K(T.  ) will  be  uniquely  determined 
by  the  S(Tj^)  and  (S(T|^_^1,  K(T|^_^)).  That  is  to  say  there  is  a one  to  one 
correspondence  between  the  possible  next  states  of  S(T|^  .j)  and  (S(T|^  -j),  K(T|^  .|)) 
under  the  same  transition  probabilities.  If  the  tag  job  is  in  a processor 
sharing  or  infinite  server  station,  this  is  no  more  true.  For  example,  assume  q 
jobs  of  the  same  class  are  in  the  same  stage  of  a processor  sharing  or  infinite 
server  station  and  one  of  them  is  the  tag  job.  At  the  next  service  completion 
instant  of  this  stage  of  the  service  center,  one  of  the  q jobs  changes  its 

status,  either  moving  to  a different  station  or  a different  stage  of  the  current 

a. 

service  center.  K(Tj^)  will  depend  upon  not  only  the  value  of  S(Tj^)  and 
(S(T|^  .|),  K(T|^_^))  but  also  the  fact  whether  the  tag  job  is  the  job  just 


} 


completing  its  service.  Nevertheless,  the  service  completion  of  the  tag  job 
can  be  determined  stochastically.  Since  the  probability  that  the  tag  job  completes 
its  service  at  this  instant  is  known  which  is  equal  to  l/q,we  can  incorporate 
the  uncertainty  on  K(T.  ) into  the  transition  probabilities  of  the  stochastic 
process  (S(t),  K(t)),  and  these  transition  probabilities  only  depend  upon  the 
current  state  (S(T|^  K(Tj^_^)).  Therefore,  we  conclude: 

Proposition  4.1 

a- 

Z(t)  = (S(t),  K(tJ)  forms  a continuous  time  Markov  chain 

In  order  to  define  the  passage  time  considered  formally,  we  need  to  intro- 
duce four  subsets  of  E',  (A^ ,A2,B^ ,62)  which  will  tell  us  in  effect  the  start  time 
and  stop  time  of  a particular  passage  time  of  the  tag  job  as  we  observe  the  sample 
path  of  the  stochastic  process.  A2  and  B2  are  the  sets  of  state  where  passage 
times  may  start  or  stop,  respectively.  Knowing  that  the  state  of  the  stochastic 
process  is  in  or  B2  is  not  sufficient  to  conclude  that  a passage  time  starts 
or  stops  in  general  under  path  restrictions.  Path  restrictions  will  be  allowed 
if  appropriate  A.|  and  B-j  can  be  defined  such  that  the  start  or  stop  of  a 
passage  time  can  be  determined  from  the  additional  fact  that  the 
process  has  also  passed  through  some  state  in  A-j  or  B.j,  respectively.  In  the 
case  where  disjoint  and  the  passage  time  terminates  at 

the  first  time  the  stochastic  process  hits  some  state  in  B2.  the  set  B^  will 
be  redundant.  Similar  remark  holds  for  A^  We  next  define  two  sequences  of 

random  times,  {S.  : j > 0}  and  {r.  : j>l},  where  S.  , is  the  start  time  of 

J J J ' 

the  j-th  passage  time  and  r.  is  the  termination  time  of  the  j-th  passage  time. 

J <\j 

Assume  that  the  initial  state  of  the  Markov  chain  {Z(t)  ; t >0}  is  such  that 
a passage  time  for  the  tag  job  begins  at  t=0.  Formally, 


% 'V 

Sj  = inf  { : z(T^)  e 1^2'  ^ ^1 

some  and  k < nl  , j^l 


17 


t.  = inf  { : 2(T^)  e B^,  2(T|^)  e 

for  some  T.  > S . , and  k < n } , j > 1 
K J - I 

Then  the  j-th  passage  time  is  simply  PS.  = r.-S.  ,,  In  the 

J J J I 

case  of  response  times,  we  will  have  A.|  = B.j , A2  = B2  and  consequently 

S.  = r.  for  all  j^l.  This  formal  definition  of  passage  time  is  similar  to  but 

J J 

somewhat  more  restrictive  than  that  in  Iglehart  and  Shedler  [14]. 

Now  we'll  try  to  convert  the  above  passage  time  problem  into  the  hitting 

time  or  first  passage  time  problem  of  another  stochastic  process,  I (tl.  The 
% 

I (t)  should  have  the  following  properties 

■k 

(1)  The  set  of  possible  initial  states,  A2  , should  be  isomorphic 

to  the  set  A2  which  may  start  a passage  time  of  Z(t).  Furthermore,  the  initial 

★ 

distribution  at  each  state  in  should  equal  to  the  stationary  probability 
that  Z[t)  may  start  the  passage  time  from  its  corresponding  state  in 

(2)  The  process  Z (t)  has  an  absorbing  state.  When  the  passage  of  Z(t) 

% 

terminates,  2 (t)  should  hit  the  absorbing  state  simultaneously. 

First  of  all,  let  us  add  an  extra  component,  I(t),  to  the  state  variable 

% 

(S(t),  K(t)).  That  is  to  say  the  new  state  variable  will  be  the  vector 

'\i  a. 

Y(t)  = (S(t),  K(t),  I(t)),  t>0 

where  I(t)  is  used  to  indicate  whether  during  the  previous  state  transitions, 

★ * 
any  state  in  B.|  has  been  passed  through.  Here  we  let  B^  be  the  direct 

extensions  of  B,  with  the  extra  component  setting  to  zero,  i.e. 

* 'V 

B^  = {(z,0)  : zeB.|}.  We  will  further  assume  the  process  Y(t)  starts  with 
some  state  U«0)  at  t=0  where  zeAp.  Since  I(T.  ) can  be  completely  determined 

-V  z K 

by  Y(T|^_.|),  we  conclude  that 

Proposition  4.2 

Y(t)  is  a continuous  time  Markov  chain  with  the  following  properties: 
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% 


(IJ  Y(0)eA, 


where  A,  = {(z,OJ  : zeA,} 


' 0 for  i=0 


’’(z^.0)(z2,0)  = {%'^Z 

0 

' 0 


for  z^  ^ 
for  z^  e 

for  e 
for  z^  ij,  B^ 


where 


{^(z^  )^Z2.''2^^  is  the  transition  matrix  of  Y(t) 

and 

{^1,^2}  is  the  transition  of  matrix  of  Z(t) 

From  properties  (1)  and  (2),  it  is  apparent 


Y(t)  = { 'V- 


U(t),0),  if  Z(T.  ) ^:  B 


for  aH0<T,<T  ,<t<T 
k n-1  - n 


(Z(t),l),  if  Z(T|^)  e B^  for  some  0 < T|^  < < t<T^ 


If  82  and  B^  are  disjoint,  the  number  of  states  reachable  from  A2  may  be 
reduced  if  we  modify  the  definition  of  the  transition  matrix  on  the  state 
(z^ ,0)  to  be 


' “r  "2f“i 

0 otherwise 


and 


for  z 


l-j  or  z^  and  Z2  e B^ 


otherwise 


for  every  Z^  e E. 
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The  difference  between  the  two  definitions  is  that  the  component  I(t)  will 

* 

not  be  set  to  one  until  the  process  Z(t)  leaves  the  set  for  the  first 
time  under  the  modified  definitions. 

★ 

Let  us  define  B2  to  be  the  direct  augmentation  of  B£  by  setting  the 
extra  component  to  1,  i.e.  B~  = {(z,l):ze  B«}.  We  further  modify  the  state 
space  of  the  stochastic  process  Y(t)  by  lumping  all  the  states  in  B^  into  a 
single  absorbing  state  r.  Clearly  the  new  stochastic  process  satisfies 

a. 

property  (2)  of  I (t)  and  for  each  state  Z-j  in  of  Z(t),  there  is  a state 

.f\j  ^ 

(z-|,  0)  in  Ap  corresponding  to  it.  That  is  to  say  the  passage  time  of 

Z(t)  will  have  the  same  distribution  as  that  of  the  hitting  time  to  state  r 

of  this  new  stochastic  process  under  an  appropriate  initial  distribution.  We 

% 

will  call  the  new  stochastic  process  Z (t)  and  the  corresponding  state  space 

★ 

E . 

As  pointed  out  earlier,  if  A2  and  B2  are  disjoint  and  the  passage  time 

terminates  at  the  first  time  that  the  stochastic  process  hits  some  state  in 

B2.  the  set  B^  is  redundant.  In  this  case,  the  extra  component  Ht)  is  also 

redundant.  Lumping  the  states  in  Bp  into  an  absrobing  state,  we  get  the 

% 

desired  stochastic  process  Z (t).  Even  if  A„  and  B„  are  not  disjoint,  we 

may  sometimes  still  be  able  to  uniquely  identify  each  state  of  Z (t)  after 

dropping  the  last  component  I(t).  In  this  case,  I(t)  is  a conceptual  tool  to 
% 

help  us  define  Z (t).  For  example,  if  the  last  stage  of  the  passage  contains 
a critical  service  center  which  can  not  be  reached  by  the  tag  job  until  the 
end  of  the  passage  time,  I(t)  can  be  dropped  at  the  final  step  even  if  the 
passage  is  a loop.  Nevertheless,  in  the  general  case,  I(t)  is  required. 

Fruthermore,  the  definition  of  passage  time  can  be  generalized  in  the 
following  way.  We  can  modify  the  definition  of  I(t)  so  that  it  will  not  be 


[4 


i.r 

I . 


'K, 

set  to  1 until  the  underlying  process  Z(t)  not  only  passes  through  some  state 
in  B-|  but  also  transits  to  appropriate  next  states  when  it  departs  from 
that  state  in  B, . The  setting  of  I(t)  can  also  be  nondeterministic.  The  rule 
can  be  that  when  the  underlying  process  Z(t)  jumping  from  state  b in  to 
state  d,  I(t)  will  be  set  to  1 with  probability  g.  That  is  to  say  (d  1) 

gPj^j  and  P^|^  qj  = (1-g)  P|^j  . An  example  of  this  type  will  be  that  a 

passage  time  terminates  at  the  time  it  transits  from  server  i to  server  j with 
probability  g.  In  this  case,  we  can  choose  B^  to  be  the  set  of  all  states 
corresponding  to  the  tag  job  being  served  at  station  i.  At  the  next  state 
transition  of  Z (t),  the  component  I(t)  will  be  set  to  1 with  probability  g if 
the  state  transition  is  due  to  the  transition  of  the  tag  job  to  station  j. 
Similar  generalization  holds  for  A^  and  A^. 


We  now  proceed  to  evaluate  the  appropriate  initial  distribution  for 
Z (t).  First  we  need  to  find  the  steady  state  distribution  of  Z(t)  = (S(t),  K(t)). 

The  steady  state  distribution  of  S(t)  is  given  in  the  previous  section.  Let 

{P(s)}  be  the  steady  state  distribution  of  S(t)  and  {P(s,k)}be  that  of  Z(t). 

Assume  the  total  number  of  jobs  in  the  network  is  N.  Clearly,  each  job  has 

probability  being  taged.  If  the  tag  job  is  in  a FCFS  or  LCFS  service 
center,  the  steady  state  distribution  at  state  (s,k)  will  be 

P(s,k)  = 1 P(s)  (4.1) 

and  if  the  tag  job  is  in  a processor  sharing  on  infinite  server  center,  the 
steady  distribution  at  state  (s,k)  will  be 


P{s,k)  = ^1^^’*^^ 


P(s 


(4.2) 


where  N^(s,k)  is  the  number  of  jobs  in  the  same  class  and  stage  of  service 
as  the  tag  job. 


1 

i 

; I 
1 i 

■ 1 

I 

In 
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To  evaluate  the  initial  distribution  of  Z (t),  we  first  consider  the 
coimion  case  where  each  state  (u,i)  in  can  transit  directly  to  some 
states  in  start  a new  passage  time^.  The  infinitesimal  transition 

rate  of  the  continuous  time  Markov  chain  Z(t)  from  state  (u,i)  to 
state  (s,k)  is  denoted  by  For  each  state 

'V  * 'X,  * 

(s,k)  in  1^2  » we  will  let  H(s,k)  denote  the  set  of  states  in  which  can 
transit  directly  to  the  state  (s,k).  The  appropriate  initial  distribution 
will  be 


n(s,k,0)  = 


(v.j)c  H(s,k) 


(w.h)  cA„ 


for  (s,k,0;  L A, 

(4.3) 
otherwi se 


n(s,k,l)  = 0 


For  the  general  case,  we  need  to  calculate  the  taboo  probability 
A n which  is  defined  to  be  the  probability  that  starting  from 

state  (u,i)  we  can  reach  state  (s,k)  without  passing  through  any  states 

in  A.|  and  during  the  intermediate  steps,  from  the  transition  maxtrix  of 

% 

the  imbedded  Markov  chain  of  Z (t).  Furthermore,  we  define  ^ ^ f*^'jo  .j 
to  be  <5  j when  both  (u,i)  and  (s,k)  are  in  A^.  Let 


c(s,k) 


. z 


P(vj) 


(v^j  ) £ A^ 
(u,i)  ^A.| ~^2 


A^,A^  ^ (u,i)(s,k) 


for  (s ,k)  e A, 


The  appropriate  initial  distribution  will  be 


Consider  the  case  where  the  passage  time  terminates  upon  the  tag  job 
transiting  from  server  x to  server  y. 
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(4.4) 


c(s.k) 


(u.i) e A, 


c(u,i ) 


n(s,k,0)= 


for  (s,k,0)  e 


otherwise 


n(s,k.l)  = 0 


it 

After  obtaining  the  appropriate  initial  distribution  of  Z (t),  let  us 
start  to  solve  for  the  distribution  of  the  hitting  time  to  the  absorbing 
state  r for  each  state  of  Z (t).  Let 

F%(t)  be  the  holding  time  distribution  function  at  state  w,  and 

Q^{t)  be  the  distribution  function  of  the  hitting  time  to  absorbing 
state  r when  starting  from  state  w.  Then  decomposing  over  the  holding  time 
and  possible  next  states  of  w,  we  get 

G;^(t)  = F;jj(t)  * ( fri' 


&v-(t-S)  dF'\^(S)  + F'v-(t) 

wv  V w'  ' wr  w'  ' 


(4.5) 


where  ZZI  means  summation  over  all  states  in  the  state  space  of  the  Markov 

'Xy.'X. 

vf  r 

chain  except  r,the  alDSorbing  state. 

Assume 


F*(S)  = f{F.(t)} 
w w 

= /"  e"^^  dF^,(t) 

0 w 

MS)  = f{G^(t)} 

= e'^^  dG^(t) 

^ w 
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i .e. 


* * * 

F^(S)  and  (^(S)  are  the  Laplace-Stieltjes  transforms  of  F'\,(t)  and 
w w w 


★ 


(»x,(t),  respectively, 
w 


Taking  Laplace-Stieltjes  transform  on  both  sides  of  (4.5)  we  get 


6T(S)  = F*(S)  (E  gX  (S)  + P^%  ) 

E (P'V'\/  F^(S)  - 6^^)  “ -P'\>'v  ^o.(S) 

wvw  wv  V wrw 

v+r 


where  6^^  _ r 1 
w V ■ ^ 


0 


if  W = V 

otherwise 


(4.6) 

(4.7) 


Equation  (4.7)  represents  a system  of  linear  equations  with  finite  dimension. 

After  solving  for  G*  (S),  we  can  get  G^(t)  by  taking  the  inverser  Laplace- 
w 

Stieltjes  transform. 

Finally,  we  get  the  passage  time  distribution  function 
G(t)  = Z 


£ A* 


n(w)  G^(t) 
w 


Before  closing  the  section,  we  would  like  to  point  out  the  fact  that 
if  we  only  tag  the  job  from  a specific  class,  we  can  obtain  the  passage  time 
distribtuion  for  a specific  class  of  jobs  directly  by  following  the  same 
procedure. 
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5.  ILLUSTRATIVE  EXAMPLE 

Consider  the  simple  queueing  network  model  shown  in  Fig.  5.1.  This 
model  is  often  encountered  in  modeling  multi  programmed  computer  systems. 

Under  this  interpretation  the  two  service  stations  will  be  referred  to  as 
the  CPU  and  I/O  unit  and  the  implication  of  a closed  queueing  network  is  that 
the  degree  of  multiprogramming  is  fixed.  Upon  completion  of  service  at  the 
CPU,  the  job  rejoins  the  tail  of  the  queue  in  the  CPU  with  probability 
and  that  in  the  I/O  unit  with  probability  l-'P-  Neither  the  CPU  nor  the 
I/O  unit  is  subject  to  preemption  and  both  follows  the  first-come  first- 
served  discipline.  The  passage  time  of  interest  will  be  the  passage  time 
denoted  by  T which  is  measured  from  entrance  into  the  CPU  queue  until 
completion  of  the  service  at  the  I/O  unit  and  rejoining  the  queue  at  the 
CPU.  Also  possible  of  interest  in  the  model  will  be  the  passage  time  denoted 

•k 

by  T defined  as  the  time  measured  from  entrance  to  the  queue  at  the  CPU  after 
I/O  completion  until  entrance  to  the  queue  at  the  I/O  unit. 

For  ease  of  illustration,  we  will  assume  there  is  only  one  class  of  jobs 
in  the  system.  The  state  variable  1 (t)  can  now  be  simplified  into 

2*(t)  = (0Q(t),  Q^(t),  K(t),  I(t)) 

where 

0Q(t)  = the  number  of  jobs  waiting  or  being  served  at  the  CPU 

Q.|(t)  = the  number  of  jobs  waiting  or  being  served  at  the  I/O  unit 

K(t)  = position  of  the  tag  job  in  the  network  counting  from  the 
tail  of  the  CPU  queue  toward  the  head  of  the  I/O  queue 

I(t)  = indication  of  whether  the  sy:tem  ever  passing  though  the 

★ 

states  in  B.j  as  defined  below 
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We  can  further  simplify  the  state  variable  using  the  fact  that  the 

total  number  of  jobs  are  fixed.  After  simplification,  the  state  variable 
% 

Z (t)  will  be  defined  as 
% 

Z (t)  = (QQ(t),  K(t),  I(t)) 

★ 

with  state  space  E and  forms  a continuous  time  Markov  chain.  For  this 

★ 

model  the  state  space  E is  given  by 

E={(j,  k,  i)  : 0_<j_<N;  l_<k^N;  0_<i_<l} 

where  N is  the  number  of  jobs  in  the  network. 

% 

Recall  the  sequence  of  refinements  needed  to  obtain  Z (t)  described  in 
section  4.  The  sets  , A^.  defining  the  start  point  and  end  point  of  the 

passage  time  T of  the  tag  job  in  the  stochastic  process  Z(t)  = (QqCt),  K(t)) 
are  given  by 

A^=  = {(i.N;  ; 0<  i <N} 

= {(0,2)  . (2,1)} 

A2  = B2  = {(i,l)  : 0<  i <N} 

= {(1.1)  , (2,1)} 

* * * . 

and  the  corresponding  A2  , B^  and  B2  in  Z (t)  are 

A2*  = {(i.1.0)  : 0<  i£N  } 

= {(1.1.0)  , (2,1,0)} 

B2*  = {(i.1.1)  : 0<  i <N} 

= {(1.1.1).  (2.1.1)} 

B*  = {(i.2,0)  : Osi  < N} 

= {(0,2,1)  , (1,2,0)} 

The  absorbing  state  is  formed  by  lumping  the  states  in  B2  , i.e.  (1.1,1)  and 
(2,1,1),  together. 
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Now  we  go  through  the  refinement  procedure  step  by  step  using  a very 

simple  case  where  N = 2 jobs.  In  Fig.  5.2,  we  show  the  state  transition 

% 

diagram  of  Z(t)  = (QQ(t),  K(t)).  There  are  six  states  in  total.  After 

adding  the  component  I(t)  to  the  state  variable,  the  state  transition 

diagram  is  shown  in  Fig.  5.3.  There  are  12  states  in  total.  Only  the  11 

states  reachable  from  the  permissible  initial  states  which  are  the  states  in  A* 

★ 

are  shown.  Finally  after  merging  the  states  in  62  into  an  absorbing  state 
r , the  transition  diagram  of  the  Markov  chain  is  shown  in  Fig.  5.4.  The 

I * I * I 

corresponding  A^  s and  B^.  s or  A^  s and  B^.  s are  also  indicated  in  each 
figure.  For  comparison,  in  Fig.  5.5  and  5.6,  we  indicate  the  A.'s  and  B.'s 
of  Z(t)  and  A^  's  and  B^.  's  of  the  corresponding  Z (t)  on  their  transition 
diagrams  for  passage  time  T*  respectively. 
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Fig.  5.5:  State  transition  diagram  of  Z(t)  = (QQ(t),  K(t)) 

and  subsets  A^.  and  for  passage  time  T* 


/ N 

/ \ 


Fig.  5.6:  State  transition  diagram  of  I (t)  and  subsets 

if  it  ^ 

Ap  and  B,  for  passage  time  T 


To  simplify  the  notation  we  will  relabel  the  states  as  shown  in 


Table  5.1 


— 

Relabelled  state 

Original  state 

'X/ 

0 

r 

1 

U.1.0) 

2 

(2.1.0) 

3 

(2.2.0) 

4 

(0.1.0) 

5 

(1,2.0) 

6 

(0.2,1) 

Table  5.1:  Relabelling  Table 


Assume  the  means  of  the  exponential  service  time  distributions  at  the 
CPU  and  I/O  unit  are  and  ;|j-,  respectively.  We  get  the  following 


infinitesimal 

generator  [11] 

, Q. 

for 

the  continuous 

time  Markov 

chain 

( 

0 

0 

0 

0 

0 

0 

0 

0 

-(u+(l-tp)X) 

0 

y 

(l-'l^)X 

0 

0 

0 

(l-i^)A 

-X 

ipX 

0 

0 

0 

0 

0 

\liX 

-X 

0 

(l-ip)  X 

0 

0 

0 

0 

0 

-y 

y 

0 

y 


0 


0 0 


0 


-lp+(l-i|;)X)  (l-ii^)X 


and  the  transition  matrix,  P,  of  the  imbedded  Markov  chain  can  be  obtained 
from  Q as  [11]. 


1 

0 


0 

0 


0 0 0 
0 y+(l-'|')A  y+(l-if))X 


0 

0 


0 

0 


P=[P. .] = 
■■  IJ 


0 (1-iJ^) 

0 0 

0 0 

y 0 

y+(l-i/))A 

1 0 


0 

'P 

0 

0 

0 


'p 

0 

0 

0 

0 


0 0 0 


0 (1-ij;)  0 


0 1 
0 0 
0 0 


0 

(1-<^)A 

y+(l-'j^)> 


Let  F^(tl  be  the  holding  time  distribution  of  state  i 

6^.(t)  be  the  hitting  time  distribution  to  state  0 from  state  i, 

* ★ 

and  F^. (S)  and  G^. (S)  be  the  corresponding  Laplace-Stieltjes  transforms, 
respectively,  as  before. 


Since  the  holding  time  is  exponentially  distributed. 


F*(S) 


A. 

S+q^. 


where  q^.  = -q^^ 

Furthermore,  let 

R = [r.j,  1 < i , j < 6] 

where  r. . = P. . F. (S)  - 6. . 

1J  ij  V ' ij 
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then  clearly 


^ .. 


R = 


-1 

S+X 

0 

0 

0 

0 


-1 

ipX 

S+X 

0 

0 

0 


S+(J) 

iJ;X 


S+X 

-1 


0 

0 

0 


S+(j) 

0 

0 

-1 

0 

0 


0 

0 

S+X 

y 

S+y 

-1 


0 

0 

0 

S+0 

-1 


where  = y + (l-ii/)X 


From  equation  (4.7),  we  get 


S*(S) 

0 

G2*(S) 

0 

G3*(S) 

= 

0 

G4*IS) 

0 

G5*(S) 

-9l 

g/is) 

-^2 

where 

9l  = ^ 

and 

9z  " sfe 

After  simplifications,  we  get 

G*(s)=  ^*"23  *"35 '^'"21  '"lay  *"21  *"l4 

^ ^ ■ *"23  *"32  ■ '"21  *"13  *"32 

G*(S)=  r^3  r32  f;(S)  + (r^3  r3g  + r,^  r45)(g^  + 
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Both  (S)  and  G2  (S)  are  rational  functions  of  S.  After  simplification 
and  taking  inverse  transform,  we  can  get  G^(t)  and  G^Ct). 


From  theorem  3.1,  we  obtain  the  initial  distribution 


nu)  = 


X(l-iJ;)+y 


Hence,  the  distribution  of  passage  time  T is 

" X(l-i!+u  *"1^^^  ^ irr^THl 


Let  us  look  at  the  case  where  X=l,  y = 0.5,  and  i|;=0.75.  The  values 
of  the  parameters  are  the  same  as  those  used  by  Iglehart  and  Shedler  [14]  where 
the  regenerative  simulation  is  used  to  estimate  the  passage  time  distributions 
of  the  same  system.  After  simplification,  we  get 


32$^  + 56$  + 15 


(64S^  + 1765^  + 124S  + 15) (S + 1 ) (2S + 1 


G*(S)=  (6S+5)  (4S  + 3) 

^ (64S  + 1765^  + 124S  + 15)(2S+1)^ 


and 

n(l)  = 1 
IU2)  ■ I 

Finally,  by  taking  the  inverse  Laplace  transform,  we  get  the  passage 
time  distribution. 

G(t)  = 1.61832(l-e‘°-^^^^^^®^)  + 0. 121284(1 

+ 0. 0234787 (1-e"^-^^^^^^^)  - 0.0793388(l-e‘°-^^) 

+ 0.0303030(l-e"°-^^  - e‘°-^^) 

which  is  a combination  of  exponential  and  Gamma  distributions. 
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6.  NUMERICAL  APPROXIMATION  ON  PASSAGE  TIME  DISTRIBUTIONS  FOR  THE  CLOSED 
QUEUEING  NETWORKS 


From  the  example  in  the  previous  section,  we  can  see  as  the  complexity 
of  the  queueing  network  increases,  the  number  of  states  increases  rapidly 
and  the  manipulations  on  transfer  functions  become  very  tedious.  In  this 
section,  a numerical  approximation  method  is  proposed.  The  basic  idea  is  to 
estimate  the  discrete  approximation  of  the  passage  time  distribution  instead. 
Under  the  discrete  approximation,  all  the  convolution  integrals  become 
recurrence  relations,  so  the  enumeration  on  computers  is  straightforward. 


Recall  F2;(t)  is  the  holding  time  distribution  function  at  state  w and  is 

w 

exponentially  distributed.  Assume  that  its  mean  is  . If  we  discretize 

the  density  function  dF^(t)/dt  into  a string  of  impulses  separated  by  d as 
in  Fig.  6.1  and  set  the  magnitude  of  each  impulse  equal  to  the  area  under  the 
density  function  on  its  left  hand  interval,  we  get  a geometric  distribution 


-X%(k-l)d  -X^kd 

Pp;;:k}  = { ® 

0 


for  k > 1 
for  k = 0 


As  d decreases,  the  accuracy  increases  and  the  efficiency  decreases.  In 
Fig.  6.1a  and  6.1b,  we  display  the  density  function  dFjj(t)/dt  and  its  discrete 
approximation  Pp^(k),  respectively.  Note  the  reason  why  the  discretization 
has  a delay  or  shift  d comes  from  the  fact  that  no  probability  mass  should 
concentrate  on  the  origin  for  holding  time  distributions  in  general. 
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(a)  density  function  dF'\,(t)/dt 

w 


Fig.  6.1:  Density  function  dPV/(t)/dt  and  its  discret 

w 


approximation 


Let  Pgrji  (j)  be  the  discrete  approximation  of  the  hitting  time  density 


Gw 

function  dGjo(t)/dt. 


The  discrete  version  of  (4.5)  now  becomes 


\ 0 for  w = r 


> 1 (6.1) 


and 


Pa5i(0)=(°  p 


, ^ % 'Xj 

1 for  w = r 


Furthermore,  using  the  fact  that  the  holding  time  is  geometrically 
distributed,  the  above  equations  for  Pq^(j)  can  be  further  simplified  into 


E * P"u%  p -\,( j_l ) 
* wv  Gv'''  ' 


vcE 


for  w f r 


- 'V  0, 

for  w = r 


j>l 


(6.2) 


and 


Pg;^(o)  = 


0 

1 


£ 'X'  \ 'Xj 

for  w f r 

. 'Xi  'Xj 

for  w = r 


where 


■k 

W V 


-A'\^d 

(1-e 


w V 


-X'vd 

w 


£ 'Xj  , 'Xj 

for  V f w 


j.  'Xj  'Xj 

for  V = w 


Clearly,  the  PqJS/(j)'s  can  now  be  solved  recursively  for  starting 

with  Pq5j(0)  for  all  weE*. 

Hence,  Pg(i)  the  discrete  approximation  of  the  passage  time  density 
function,  dG(t)/dt,  can  be  expressed  as 
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16.3) 


Pr(i)  = ^ 


A 

w = e A., 


niw)  Pgjo(i) 


where  the  initial  distribution  n ( w)  is  given  in  (4.3)  or  (4.4). 


By  linear  interpolation,  we  get 


G(t) 


^ ^ PM) 

j=l 


t-id 

d 


Pg(H1) 


for  id  < t < (i+1 )d 


Finally,  let  us  apply  the  approxi (nation  technique  to  estimate  the 
response  time  distribution  of  the  multi  programmed  computer  system  model  in 
the  previous  section.  In  table  6.1,  we  tabulate  not  only  the  approximate 
percentile  response  time  distributions  under  two  different  d values,  0.05 
and  0.1,  but  also  the  analytic  result  obtained  in  the  previous  section  and 
the  simulation  result  obtained  by  Iglehart  and  shedler  [14].  The  approximation 
leads  to  very  satisfactory  results  especially  when  the  size  of  d is  small. 
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Companson  of  analytic  result,  numerical  approximation 
and  simulation  result  on  percentile  response  time 


7.  APPROXIMATE  PASSAGE  TIME  DISTRIBUTIONS  FOR  THE  OPEN  QUEUEING  NETWORKS 

The  major  problem  encountered  on  tackling  open  queueing  network  is 
that  the  state  space  of  the  network  is  infinite.  We  first  use  a specific 
passage  time  which  is  the  response  time  of  a job  to  illustrate  the  similarities 
and  differences  encountered  in  handling  open  queueing  networks  and  closed 
queueing  networks.  We  assume  that  the  network  has  already  been  in  steady 
state  at  t = O"  and  a job  arrives  at  t = O'.  We  will  use  this  newly  arrival 
job  as  our  tag  job.  The  state  variable  of  the  network  will  still  take  the 
form 

% 

Z (t)  = (S(t),  K(t).  I(t))  t>0 

★ ★ *■ 

as  that  in  the  closed  queueing  network.  The  set  A2,  , and  B2  will  also 

★ 

be  defined  as  before.  Actually,  the  set  B^  is  redundant  in  this  case,  but 

★ 

for  more  general  passage  time  it  is  indeed  required.  The  states  in  which 

correspond  to  all  possible  start  points  of  response  times  have  the  form 

(s,k,0)  where  the  k-th  component  of  s describes  a job  which  may  be  considered 

to  be  a new  arrival  to  the  queueing  network.  To  be  more  specific,  a job 

can  be  considered  as  the  newly  arrival  job  to  the  queueing  network,  if  it 

is  the  last  job  in  the  queue  of  a FCFS  service  center,  or  the  job  currently 

being  served  in  the  first  stage  of  a LCFS  service  center,  or  any  job  in  its 

★ 

first  stage  of  service  in  a PS  or  IS  service  center.  The  set  B2  which  corre- 
sponds to  all  possible  end  points  of  response  times  and  will  eventually  be 
lumped  into  an  absorbing  state  consists  of  all  states  which  can  be  the  next 
states  immediately  after  an  exit  of  the  tag  job  from  the  queueing  network. 

* '\j 

The  states  in  B2  have  the  form  (s,0,l).  Notice  we  set  K(t)  equal  to  zero 
to  indicate  that  the  tag  job  has  left  the  network  by  time  t. 

i 
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We  now  proceed  to  evaluate  the  initial  distributions  of  the  states  in 

it 

Ap  under  our  previous  assumptions.  Let  us  for  the  moment  return  to  the  state 

^ 'Vi 

description  of  S(t)  with  state  space  E cited  in  section  3.  Let  H(s,k)  be  the 

state  that  leads  to  s after  an  external  arrival  whose  class  and  enterning 
service  center  is  described  by  the  k-th  component  of  s where  both  s and  H(s,k)eE. 

Here  we  implicitly  assume  that  the  k-th  component  of  s can  indeed  represent 
a new  arrival  to  the  network.  Notice  given  s and  k,  the  preceding  state  H(s,k) 

is  unique.  To  be  more  precise,  if  the  arrival  enters  from  a FCFS  or  LCFS 

service  center,  H(s,k)  is  obtained  by  deleting  the  k-th  component  of  s and  if 

the  arrival  enters  from  a PS  or  IS  service  center,  H(s,k)  is  obtained  by 

decreasing  the  k-th  component  of  s by  1 and  deleting  it  if  it  drops  to  zero 

after  decrementation.  Let  C(s,k)  be  the  service  class  of  the  new  arrival 

described  by  the  k-th  component  of  s and  T(s,k)  be  its  entering  service  center, 
n 

Recall  that  P^.  ^ is  defined  in  section  3 to  be  the  probability  that  the  new 

arrival  will  be  of  class  r and  entering  from  service  center  i.  Clearly,  the 

transition  probability  from  H(s,k)  to  s conditioning  on  a new  arrival  will  be 
A 

^T(s  k)  C(s  k)‘  til®  probability  that  conditioning  on  a new  arrival 

with  class  C(s,k)  and  entering  service  center  T(s,k),  the  network  will  be  in 
state  s is  given  by 

p„(U)  - (7.1) 

where  P(H(s,k))  can  be  evaluated  by  Theorem  3.1.  Furthermore,  the  above 
probability  is  in  fact  the  initial  probability  distribution  of  the  state 
(s,k,0)  in  Ap  of  the  augmented  state  space  E , i.e.  n(s ,k,0)=PQ(s ,k) . 

Following  the  same  argument  for  closed  queueing  networks,  we  can  derive 

a 

the  same  equations  as  in  section  4 for  &\<(t)  and  the  hitting  time 

w w 


41 


distribution  and  its  Laplace-Stieltjes  transform  for  each  state  w in  E , 
respectively  • That  is  in  time  domain 

&v(t)  = A r K'v-  &^-(t-$dF'^.(S)  + F'v.(t)  (7. 21 

w 0 wv  V w'  ' wr  w'  ' ' ' 

in  transform  domain 

fC(S)  &v(S)  = -P^'\.  F*(S)  (7.3) 

y.4.^  wvw''  wv  v wrw''  ' ' 

vfr 

But  the  system  of  equations  appeared  in  (7.3)  now  has  infinite  dimensions. 

* o- 

Clearly,  we  only  need  to  consider  a finite  subset, D ={(S,K,i ) :SeD) , in  the  state 
% 

space  of  I (t)  where  D is  the  most  frequently  occurred  states  of  Z(t)  sach  that 

% 

the  total  steady  state  probability  of  Z(t)  in  d , which  can  be  derived  from 
theorem  3.1,  is  close  to  1 . By  neglecting  the  other  states  not  in  that 
subset,  the  system  of  equations  reduces  to  a finite  set  of  linear  equations. 

We  note  that  after  the  state  reduction  the  transition  probabilities  should  be 
normalized. 

To  simplify  the  problem,  we  can  again  apply  the  discretization  technique 

introduced  in  section  6 to  obtain  recurrence  relations  for  the  discrete 

approximation  of  the  density  function  of  the  hitting  time  for  each  state  w. 

The  recurrence  relations  are  similar  to  (6.2)  except  that  now  we  have  infinite 

simultaneous  recurrence  relations  due  to  the  infinite  state  space.  Using 

* 

the  same  principle  cited  above  to  reduce  the  state  space  to  D , the  problem 
becomes  solvable.  The  discrete  approximation  of  the  density  function  of  the 

response  time  is  again  give  by  (6.3)  except  that  ^ ^ becomes  an  infinite 

(s,k,0)cA. 

★ ^ ★ 

sunmation.  By  considering  only  the  intersection  of  and  D , 
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the  sumnation  becomes  a finite  summation.  As  before,  all  the  probabilities 
should  be  normalized  after  the  state  reduction.  As  long  as  the  traffic  in 
the  network  is  not  heavy,  all  the  infinite  suimiations  appeared  above  will 
converge  very  quickly  and  only  a reasonable  number  of  states  need  to  be 
considered. 

Alternatively,  we  can  approximate  an  open  system  by  a closed  system 
and  apply  the  result  in  section  6 directly.  Since  the  queue  length  distribution 
of  the  system  is  known,  we  can  calculate  the  quantile  of  queue  length 
distribution  with  no  difficulties.  Let  N be  the  95%  or  99%  quantile  of  the 
distribution  of  the  total  number  of  jobs  in  the  system  depending  upon  the 
accuracy  desired.  That  is  to  say  95%  or  99%  of  the  time  the  number  of  jobs 
in  the  network  is  less  than  or  equal  to  N.  So  if  we  consider  a queueing 
network  which  is  identical  to  the  original  network  in  all  respects  except 
that  the  arrival  process  is  shut  down  when  the  number  of  customers  in  the 
system  is  equal  to  N,  the  performance  difference  should  be  very  minor.  The 
new  queueing  network  can  be  viewed  as  a closed  queueing  network  with  N 
customers  as  shown  in  Fig.  7.1.  In  fact,  when  N the  performance 
the  two  network  models  becomes  identical.  As  long  as  the  traffic  in  the  network 
is  not  heavy,  N will  be  reasonably  small. 

Let  us  consider  an  example.  The  example  chosen  is  an  open  two  server 

queueing  model.  As  in  the  closed  two  server  queueing  model,  the  servers  can  be 
interpreted  as  CPU  and  I/O  unit,  respectively.  Now  the  total  number  of  jobs  being 
activated  is  assumed  to  be  variable.  The  open  queueing  model  and  its  approximate 
closed  queueing  model  are  given  in  Fig.  7.2a  and  7.2b  respectively.  The  passage 
time  under  interest  is  the  response  time  which  is  the  time  between  arrival  and 
departure  of  a job  to  the  system.  In  Fig.  7.2b,  the  equivalent  passage  time  in 
the  closed  network  is  the  time  measured  from  entrance  into  the  CPU  queue  from 
the  source  until  the  entrance  into  the  source  after  service  completion  at  the 


I/O  unit. 
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queueing 

/ 

network 

^ 

(a)  open  queueing  network  model 


the  original 

queueing  network 
in  U) 


©=> ‘ 

(b)  approximate  closed  queueing  network  model 


Fig.  7.1:  An  Open  Queueing  Network  Model  and  Its  Approximate 

Closed  Queueing  Network  Model 


(aj  open  queueing  network  model 
source 


(b)  approximate  closed  queueing  network  model 

Fig.  7.2:  An  Open  Two  Server  Queueing  Model  and  Its  Approximate 

Closed  Queueing  Network  Model 
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In  Table  7.1,  we  compare  the  simulation  result  and  the  approximation  results 


> 


on  percentile  response  time  when  tp  = 0.3,  a = 0.1,  A=  0.1,  = 1 and  ~ 

The  simulation  result  is  obtained  via  regenerative  simulation  where  not  only  the 
point  estimation  but  also  the  95%  confidence  interval  are  provided.  We  evaluate 
the  approximate  percentile  response  time  under  two  different  cut  off  values,  i.e. 
N = 3 and  4.  Furthermore,  the  interval  of  discrete  approximation,  d,  is  chosen 
to  be  0.05.  As  we  can  see  the  approximations  obtained  ;under  different  N's  have 
little  difference  since  the  system  seldom  has  more  than  3 jobs  simultaneously. 

The  approximation  results  are  also  very  close  to  the  simulation  results.  In 
Table  7.2,  we  show  the  approximation  results  for  N = 3 and  4 when  the  value  of 
d is  doubled.  The  results  are  still  very  nice.  In  Table  7.3,  we  compare  the 
simulation  result  and  the  approximation  results  on  percentile  response  time 
when  ip  increases  to  0.75.  The  traffic  intensity  increases  in  this  case,  so  does 
the  appropriate  cut  off  value  on  the  arrival  process.  We  consider  two  different 
approximations  for  N = 7 and  8,  respectively.  The  value  of  d is  chosen  to  be 
0.05.  Again,  not  only  the  approximate  results  under  different  cut  off  values 
have  very  little  difference,  but  also  they  are  close  to  the  simulation  result. 


Percentile 

Simulation 

approximation 

N=3 

N=4 

{R<  1} 

0.0758  - 0.0059 

0.0736 

0.0723 

{R<2} 

0.227  - 0.010 

0.225 

0.221 

IR<  3} 

0.384  + 0.012 

0.386 

0.379 

{R<4} 

0.523  + 0.013 

0.528 

0.520 

{R<5} 

0.640  + 0.013 

0.644 

0.634 

{R<6} 

0.729  + 0.012 

0.734 

0.724 

{R<7} 

0.798  + 0.011 

0.802 

0.793 

{R<8} 

0.847  + 0.010 

0.854 

0.846 

{R<9} 

0.884  + 0.009 

0.892 

0.885 

{R<  11} 

0.935  + 0.007 

0.941 

0.936 

•* 

Table  7.1:  Percentile  Response  Time  Whenij;=  0.3,  a = 0.1,  X = 0.1 

Pi  =1,  U2  - 0.5, and  d = 0.05 
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roximation 


Percent! le 

N-3 

N=4 

P{R<  1} 

0.0695 

0.0682 

P{R<  2} 
P{R<  3} 

0.219 

0.378 

P{R£4) 

0.520 

0.512 

P{R<  5} 

0.636 

0.627 

P{R<  61 

0.727 

0.718 

P{Rj^7) 

0.797 

0.788 

P(Rj;  81 

0.849 

0.841 

P(R<  91 

0.888 

0.881 

P{R^11} 
< 

0.939 



0.933 

Table  7.2:  Percentile  Response  Time  When ’>];  = 0.3 

0.1,  u,  = 1 , u.,  = 0.5  and  d = 


r 

approximation  I 

Percentile 

Simulation 

fF7 

m [ 

P{R  < 1} 

0.0202  + 0.0026 

0.0208 

P' R < 2} 

0.0755  + 0.0053 

0.0752 

0.0749  1 

p:r<  3} 

0.148  + 0.0076 

0.148 

0.147  i 

P-R^5} 

0.304  + 0.011 

0.306 

0.305  1 

P‘P  < 7} 

0.443  + 0.013 

0.448 

0.447 

PIR  < 9} 

0.562  + 0.013 

0.564 

0.562 

P^R<  11} 

0.653  + 0.13 

0.653 

0.652  I 

P{P.^13} 

0.723  + 0.013 

0.723 

0.721 

P1R<  16} 

0.797  + 0.012 

0.799 

0.797 

P{R<  20} 

0.867  + 0.010 

— 

0.865 

0.864 

lable  7.3:  Percentile  Response  Time  When  = 

> = 0.1 , u,  = 1 , u„  = 0.5,  and  d 


8.  A CASE  STUDY  ON  INTERFAULT  TIME  DISTRIBUTIONS  AND  RESPONSE  TIME 
DISTRIBUTIONS  OF  MULT I PROGRAMMED  COMPUTER  SYSTEMS 

We  now  proceed  to  investigate  the  distribution  and  density  functions  of 
two  important  quantities  encountered  in  analyzing  the  performance  of  a multi- 
progratmed  computer  system,  namely  interfault  time  and  response  time.  Both 
quantities  are  measured  in  real  time.  The  number  of  acti ve  processes  will  affect 
the  interfault  time  and  response  time  on  two  folds.  It  not  only  explicitly 
implies  the  contention  level  on  the  processors  but  also  implicitly  implies 
the  contention  level  on  main  memory.  The  amount  of  memorv  allocated  will  have 
a drastic  effect  on  the  number  of  instructions  executed  between  page  faults, 
i.e.  the  virtual  interfajlt  time.  In  order  to  capture  the  memory  effect  on 
the  performance,  we  will  take  the  approach  of  hierarchical  modeling.  In  the 
first  level  of  modeling  hierarchy  we  only  consider  the  interactions  between 
the  CPU  and  paging  device.  The  closed  two  server  queueing  model  in  Fig.  8.1a 
is  used  to  represent  the  CPU-PGU  subsystem.  The  interfault  time  of  a process 
is  defined  to  be  the  time  between  two  consecutive  epochs  that  the  process 
enters  the  CPU  queue  after  receiving  service  from  the  paging  device.  The 
stoppage  of  CPU  processing  can  either  be  due  to  the  expiration  of  allocated 
time  slice  or  a page  fault.  In  the  first  case,  the  process  rejoins  the  CPU 
queue  and  in  the  second  case,  it  joins  the  queue  of  the  paging  device.  In 
the  second  level  of  modeling  hierarchy,  we  consider  the  interaction  between 
the  CPU-PGU  subsystem  and  file  I/O  device.  Another  closed  two  server  queueing 
model  in  Fig.  8.1b  is  used  for  this  case.  The  response  time  of  a process  is  defined 
to  be  the  time  between  two  consecutive  time  epjchs  that  the  process  passes 
through  the  self  loop  of  the  CPU-PGU  subsystem.  That  is  to  say  a transfer 
through  the  self  loop  can  be  viewed  as  a termination  of  a process  and  entering 
of  a new  process  at  the  same  instant. 
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(CPU)  (page  I/O) 


(a)  CPU-PGU  model  (first  level) 


(b)  Multi  programmed  computer  system  model 
(second  level) 

Fig.  8.1:  Hierarchical  Models  of  a Mul ti programmed 

Computer  System 
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The  virtual  interfault  time  depends  on  the  program  behavior.  As  studied 
by  Chamberlin,  Fuller  and  Lin  [30],  the  mean  virtual  interfault  time,  q,  can  be 
expressed  as 


q = 


2S 


where 


M: 

main  memory  size 

d: 

the  number  of  pages  that  provides  the  process 
its  largest  possible  life  time 

with  half  of 

S: 

the  expected  virtual  interfault  time  when  the 
allocated  a memory  space  d 

process  is 

n: 

degree  of  mi Iti programming 

We 

will  consider  two  types  of  program  behavior  as 

considered  in  [30],  [31] 

Type  1 : 

S = 25  ms 

d = 50  pages 

Type  2: 

S = 20  ms 

d = 60  pages 

Clearly,  Type  1 programs  lead  to  better  performance.  We  will  let  N denote  the 
total  number  of  processes  being  activated,  i.e.  the  total  number  of  processes  in 
the  CPU,  page  and  file  I/O  devices.  Furthermore,  we  assume  that  processes  in  the 
file  I/O  queue  will  be  swapped  out  from  memory.  Note  n denotes  the  total  number 
of  processes  contending  for  memory,  i.e.  the  total  number  of  processes  in  the  CPU 
and  paging  device  which  is  usually  referred  to  as  the  degree  of  multiprogramming. 

We  begin  with  the  first  level  model  in  Fig.  8.1a.  Let  us  assume  that  the 
total  memory  size  is  128  pages  and  the  time  slice,  t^,  for  each  job  is  50  msec. 
The  mean  service  time,  tp^,  of  the  paging  device  is  assumed  to  be  5 msec.  The 
mean  CPU  overheads  for  a page  fault  and  a process  switching,  referred  to  as  Op 
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and  0^,  are  assumed  to  be  0.2  and  0.3  msec.,  respectively.  The  mean  time 
between  service  completions  at  CPU  is  approximately 

^CPU  ^ tg+  q °s  (l-'l')Op 

where 


The  numerical  approximation  technique  in  section  6 is  employed  in  the  following 
study.  In  Fig.  8.2a  and  Fig.  8.3a,  we  plot  the  interfault  time  distribution 
functions  and  density  functions  for  type  1 programs  when  n=2,  4,  6 and  8, 
respectively.  The  time  is  normalized  with  respect  to  mean  virtual  interfault  time. 
In  Fig.  8.2b  and  8.3b  we  plot  the  same  curves  for  type  2 programs.  As  we  can 
see  when  the  degree  of  multiprogramming  increases,  the  changes  in  the  shape  of 
distribution  functions  or  density  functions  follow  the  same  pattern  for  both 
types  of  programs.  Nevertheless,  for  type  2 programs,  the  performance  deteriorates 
further  as  n becomes  large.  In  Table  8.1  and  8.2,  we  tabulate  the  CPU  utilization 
under  various  values  of  n for  both  types  of  programs.  The  CPU  utilization  for 
type  2 programs  decrease  sharply  as  n goes  to  8.  The  CPU  utilizations  given 
in  Table  8.1  and  8.2  will  be  used  in  the  next  level  of  the  multi  programmed 
computer  system  model  to  determine  the  effective  CPU  execution  time  between  file 
I/O  requests  or  process  completions.  The  effective  CPU  execution  time  is  defined 
to  be  the  CPU  execution  time  devided  by  the  true  CPU  utilization  referred  to  as 


U 


CPU. 
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FIG  8. 2. A INTE 


FIG  8.2.B  INTE 
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degree  of 
multiprogramming 

CPU  utilization 
(including  overhead) 

CPU  utilization 
(true),  Uj.p^ 

1 

0.898 

0.882 

2 

0.979 

0.956 

3 

0.991 

0.961 

4 

0.992 

0.952 

5 

0.989 

0.937 

6 

0.980 

0.914 

7 

0.956 

0.876 

8 

0.904 

0.811 

Table  8.1:  CPU  Utilization  Under  Type  1 Programs 


degree  of 
multiprogramming 

CPU  utilization 
(including  overhead) 

CPU  utilization 
(true) , 

1 

0.870 

0.851 

2 

0.959 

0.931 

3 

0.970 

0.929 

4 

0.961 

0.903 

5 

0.929 

0.854 

6 

0.859 

0.768 

7 

0.749 

0.649 

8 

0.629 



0.526 

Table  8.2:  CPU  Utilization  Under  Type  2 Programs 
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Now  let  us  consider  the  second  level  model  in  Fig.  8.1b.  We  further 
assume  that  t^,  the  mean  CPU  execution  time  of  a process,  is  100  ms  and 
t^,  the  mean  execution  time  between  file  I/O  requests  of  a process,  is  50  ms.  The 
service  time  of  the  file  I/O  device  is  assumed  to  be  50  ms.  The  CPU  overheads  for 
a file  I/O  fault  and  a process  termination  and  initiation,  referred  to  as  0^  and 
0^,  are  assumed  to  be  4 ms  and  0.3  ms,  respectively.  The  mean  service  completion 
time  of  the  CPU-PGU  subsystem  is  approximately 

^CPU-PGU  " lt^  + tJU(,p^j  °c  °f 

where 


In  Fig.  8.4a  and  8.5a,  we  plot  the  response  time  distribution  functions  and 
density  functions  for  type  1 programs  when  N = 2,  4,  6 and  8.  The  response  time 
is  normalized  with  respect  to  the  mean  CPU  execution  time  of  a process.  In 
Fig.  8.4b  and  8.5b,  we  plot  the  same  curves  for  type  2 programs.  As  we  can 
see  as  N increases,  the  changes  in  the  shape  of  response  time  distributions  or 
densities  follow  the  same  pattern  for  both  types  of  programs,  respectively. 

In  Table  8.3,  we  tabulate  the  mean  response  times  for  both  cases.  As  we 
can  see  when  N is  less  than  6,  the  response  time  distributions  or  densities 
under  both  types  of  programs  are  very  close  to  each  other.  When  N further 
increases,  the  performance  under  type  2 programs  deteriorates  faster  than 
than  under  type  1 programs  since  the  paging  device  is  not  fast  enough  to 
support  paging  requests. 

Let  us  change  the  mean  execution  time  between  file  I/O  requests  to  30  msec, 
and  plot  the  same  kind  of  curves  in  Fig.  8.4e,  8.4d  and  Fig.  8.5c,  8.5d  as 
before.  Now  the  file  I/O  request  rate  increases  and  the  file  I/O  device  becomes 
the  bottleneck  of  the  system.  Comparing  Fig.  8.5c  with  8.5a  or  Fig.  8.5d  with  8.5b, 
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we  can  see  the  forms  of  density  functions  change  sharply  from  before.  In 
Table  8.4,  we  tabulate  the  mean  response  times  for  both  types  of  programs. 

Since  processes  contending  for  the  file  I/O  device  do  not  content  for  memory  resource, 
the  performance  difference  between  type  1 and  type  2 programs  become  smaller 
compared  with  the  previous  case. 

To  investigate  the  balancing  of  a computer  system  let  us  keep  the  degree 
of  multiprogramming  to  be  8 and  the  processing  rate  of  the  CPU  and  paging 
device  as  before.  Type  2 programs  are  used  for  illustration.  In  Fig.  8.6a 
and  8.6b,  we  plot  the  interfault  time  distribution  functions  and  density 
functions  under  memory  sizes  of  128,  192,  256  and  320  pages,  respectively. 

The  time  is  normalized  with  respect  to  mean  virtual  interfault  time.  In 
Table  8.5,  we  tabulate  the  mean  virtual  interfault  time  for  each  case.  Although 
the  mean  virtual  interfault  increases  steadily  according  to  almost  linear  rate, 
the  closeness  of  the  distribution  curves  after  memory  size  exceeds  192  pages 
indicates  that  the  system  attains  its  balance  after  the  memory  size  exceeds 
192  pages.  Further  increasing  the  memory  size  only  makes  the  paging  device 
idle  most  of  the  time.  Notice  the  crossing  of  the  distribution  functions  is 
due  to  the  time  slice  control  which  forces  the  long  interfault  time  to  be 
interrupted  and  increases  the  tail  of  the  interfault  time  distribution.  This 
phenomenon  becomes  more  apparent  as  the  memory  size  or  the  mean  virtual  interfault 
time  further  increases.  Apparently,  when  the  memory  size  is  128  pages,  the 
system  is  not  balanced  and  the  paging  device  is  overloaded.  Instead  of  increasing 
memory  size  to  make  the  system  balance,  we  can  increase  the  speed  of  the 
paging  device.  In  Fig.  8.7a  and  8.7b,  we  plot  the  interfault  time  distribution 
functions  and  density  functions  when  service  rate  of  the  paging  device,  u,  is  0.2, 

0.4,  0.6  and  0.8  processes/msec,  respectively.  The  original  service  rate  of 
the  paging  device  is  0.2.  After  we  double  its  service  rate,  the  system  gets  close 
to  balance.  Further  increasing  the  speed  of  the  paging  device  has  marginal  effect 
on  the  performance. 
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FIG  8.4.A  RES 
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N0RMALIZED  RESP0NSE  TIME 


FIG  8.5X  RESP0NSE  TIME  DENSITY 


AiISN3a 
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N0RMALIZED  RESP0NSE  TIME 


N0RMALI2ED  RESP0NSE  TIME 


N 

type  1 program 

2 

326 

4 

540 

6 

766 

8 

1028 

type  2 program 


Table  8.3:  Mean  Response  Time  When  the  Mean  Time  Between 

File  I/O  Requests  is  50  msec 


type  1 program 


type  2 program 


Table  8.4:  Mean  Response  Time  When  the  Mean  Time  Between 

File  I/O  Requests  is  30  msec 


Table  8.5:  Mean  Virtual  Interfault  Time  Under  Various 

Memory  Sizes. 


9.  CONCLUSION 


Queueing  network  models  have  been  used  extensively  in  modeling  computer 
systems  and  computer  communication  networks.  The  situations  where  we  can  get 
analytic  solutions  for  stationary  state  probabilities  have  been  studies  in  the 
past  few  years.  Although  their  average  values  can  be  obtained  through  Little's 
formula,  the  response  time  distributions  or  the  more  general  passage  time 
distributions  have  not  yet  been  solved.  The  distribution  function  can  provide 
us  with  a lot  of  useful  information,  such  as  variance  or  other  higher  moments, 
percentile,  quantile,  etc.  By  transforming  passage  times  into  hitting  times 
of  appropriate  Markov  systems,  we  derive  an  analytical  solution  for  passage 
time  distributions  for  the  same  class  of  closed  queueing  networks  specified 
in  section  3,  where  the  analytic  solution  on  stationary  state  probabilities 
is  available.  Avoiding  matrix  inversion  in  transform  domain  required  by  the 
exact  analysis,  efficient  numerical  approximation  replacing  convolutions  by 
recurrence  relations  is  also  proposed  using  the  concept  of  discretization  of 
the  distribution  functions.  Then  we  consider  passage  time  distributions  for 
the  class  of  open  queueing  networks  specified  in  section  3.  The  result  for 
closed  queueing  networks  is  extended  to  obtain  approximate  passage  time  distributions 
for  open  queueing  networks.  Finally,  we  employ  the  techniques  derived  in  this 
paper  to  study  the  interfault  time  distributions  and  response  time  distributions 
of  multi  programmed  computer  systems.  The  effects  of  program  behavior,  degree 
of  multiprogramming,  size  of  main  memory,  service  time  of  paging  devices,  and 
rate  of  file  I/O  requests  on  the  shape  of  distribution  functions  and  density 
functions  have  been  examined. 
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